##############
###Datasets###
##############

This Harvard Dataverse entry includes the following two datasets (in CSV format):

RALMevent.csv: Individual incident (i.e., event) level data on radical environmental direct actions associated with the North American radical animal liberation movement (RALM) during the years 1995-2007. A small number of additional events are coded and included for 1994 as well, though these are less comprehensive. For the dataset, each row corresponds to a discrete atomic event and each column denotes an event characteristic or original coding source characteristic. The variables included in this dataset are detailed in the accompanying dataset paper: "Historical Spatio-Temporal Data on North American Radical Environmental Direct-Action Events."

RALMaggregated.csv: A spatio-temporally aggregated version of RALMevent.csv. This second aggregated dataset is intended to more readily enable large N statistical analyses. It only includes events and years beginning in 1995 (thus omitting the small number of 1994 events mentioned above). All retained events were spatially aggregated to the 0.5 x 0.5 decimal degree grid cell-level for the US and Canada and were temporally aggregated to the yearly level for 1995-2007. A wide array of additional grid(-year) variables are also included in this aggregated dataset, and are discussed in detail in the accompanying dataset paper: "Historical Spatio-Temporal Data on North American Radical Environmental Direct-Action Events."

#######################
###Aggregation Files###
#######################

Creation of the data contained in RALMaggregated.csv required a series of additional dataset inputs, aggregation scripts, and merge scripts. These various files are contained in the "Merge Data" subfolder to this Harvard Dataverse entry. In particular, this subfolder includes:

-A number of raw input dataset files that are aggregated, formatted, and eventually merged together, along with associated intermediate data files created during these steps. Each associated file is stored as either a .csv or .dta file within this folder.

-A series of initial aggregation scripts used for aggregating relevant variables over time, actors, and/or by lat-long coordinates. Env_Group_Evts_agg.R aggregates the RALMevents.csv and associated coded data on RALM groups. The various Event.Aggregator.[...].R files aggregate relevant Phoenix events based upon the events' original coding (media) source and a target event pairing, such as government-to-civilian events or civilian-to-any events.

-A series of intermediate aggregation scripts that combine (i) the aggregated Phoenix event data into appropriate CAMEO quad categories (CombineThreeDigitPhoenix.do) and that combine and rename several PRIO-GRID specific (ID) variables for subsequent merging (CombinePrioGrid.do)

-A series of merge scripts that use KNN matching to associate each aggregated event sum or (ID) variable value to its appropriate grid cell (and GID) based upon (i) that event sum or variable's latitude-longitude coordinates and (ii) each grid cell centroid's latitude longitude coordinates. These scripts include: gidmatch_stateID.R, gidmerge_incident_groups.R, gidmerge_mainstream_disasters.R, and gidmerge_PHX_events.R

-A final merge script that combines all aggregated data and variables to the GID-year level, in addition to further formatting, aggregating, and renaming several variables (Merge_Final_Dataset.do). Note: this file generates the final aggregated dataset mentioned further above: RALMaggregated.csv.


####################
###Vizualizations###
####################

The datasets discussed above were used to create a number of vizualizations within the accompanying dataset paper: "Historical Spatio-Temporal Data on North American Radical Environmental Direct-Action Events." All files for replicating these vizualizations appear in the Vizualizations subfoder to this Harvard Dataverse entry. In particular, this subfolder includes:

-Versions of the main datasets discussed above (RALMevent.csv and RALMaggregated.csv) which correspond to the primary inputs needed for all vizualizations

-A host of spatial data files for both the US and Canada. These files---and in particular the .shp files---are used in creating the heatmaps reported in Figures 6-7.

-Two R scripts that allow one to directly create each relevant vizualization after reading-in the input files discussed above: Figures1-5.R and Figures6-7.R


##########################
###Additional resources###
##########################

All R-based outputs were generated using R version 4.3.1

All Stata-based outputs were generated using Stata 11

Further details on the datasets, variables, and the methods used to produce these data can be found in: the accompanying dataset paper: "Historical Spatio-Temporal Data on North American Radical Environmental Direct-Action Events."

For questions, please contact Benjamin Bagozzi at bagozzib@udel.edu.

